Automated System for Improving RSS Feeds Data Quality
نویسنده
چکیده
Nowadays, the majority of RSS feeds provide incomplete information about their news items. The lack of information leads to engagement loss in users. We present a new automated system for improving the RSS feeds’ data quality. RSS feeds provide a list of the latest news items ordered by date. Therefore, it makes it easy for a web crawler to precisely locate the item and extract its raw content. Then it identifies where the main content is located and extracts: main text corpus, relevant keywords, bigrams, best image and predicts the category of the item. The output of the system is an enhanced RSS feed. The proposed system showed an average item data quality improvement from 39.98% to 95.62%.
منابع مشابه
Recommendation of Personalized Rss Feeds Based on Ontology Approach and Multi-agent System in Web 2.0
Nowadays, multi-agent systems (MAS) are used in many fields such as industry, education, finance, etc. MAS counts among the most promising technological paradigms in the development of Web applications. They can contribute significantly to improving the quality of the use of these applications. In this paper, we propose a new recommendation approach for personalized RSS (Really Simple Syndicati...
متن کاملFeedTree: Sharing Web Micronews with Peer-to-Peer Event Notification
Syndication of micronews, frequently-updated content on the Web, is currently accomplished with RSS feeds and client applications that poll those feeds. However, providers of RSS content have recently become concerned about the escalating bandwidth demand of RSS readers. Current efforts to address this problem by optimizing the polling behavior of clients sacrifice timeliness without fundamenta...
متن کاملRSS Feed Recommendation
Introduction Really Simple Syndication (RSS) Feeds allows users to access blogs and articles in an easy to read format. It cuts out the overhead of navigating websites for content and allows users to get information more quickly. Currently, the user is in total control of their RSS feeds, adding and deleting feeds according to their tastes. This requires the user to actively search out RSS feed...
متن کاملMatt Fuller
Traditionally users subscribe to RSS feeds of interest using an RSS feed reader. The RSS feed reader periodically polls the subscribed feeds for updates or items to be displayed to the user. Many RSS feeds usually pertain to a single news source or blog. Others may aggregate various feeds usually on some topic and produce a single RSS feed. Middleware publishsubscribe systems allow users to sub...
متن کاملSentiment Analysis for Effective Stock Market Prediction
The Stock market forecasters focus on developing a successful approach to predict stock prices. The vital idea to successful stock market prediction is not only achieving best results but also to minimize the inaccurate forecast of stock prices. This paper attempts to design and implement a predictive system for guiding stock market investment. The novelty of our approach is the combination of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1504.01433 شماره
صفحات -
تاریخ انتشار 2015